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Abstract. We revisit the mass properties of the lensing cluster of galaxies MS2137-23 and assess the mutual 
agreement between cluster mass estimates based on strong/weak lensing, X-rays and stellar dynamics. We perform 
a thorough elliptical lens modelling using arcs and their counter-images in the range 20 < R < 100 kpc and weak 
lensing (100 < R < 1000 kpc). We confirm that the dark matter distribution is well consistent with an NFW 
profile with high concentration c ~ 11.7 ± 0.6. 

We further analyse the stellar kinematics data of Sand et al. (2004) with a detailed modelling of the line-of-sight 
velocity distribution of stars in the cD galaxy and quantify the small bias due to non-Gaussianity of the LOSVD. 
After correction, the NFW lens model is unable to properly fit kinematical data and is a factor of ~ 2 more 
massive than suggested by X-rays analysis (Allen et al. 2001). 

The discrepancy between projected (lensing) and tridimensional (X-rays,dynamics) mass estimates is studied by 
assuming prolate (triaxial) halos with the major axis oriented toward the line-of-sight. This model well explains 
the high concentration and the misalignement between stellar and dark matter components (Aip ~ 13°). 

We then calculate the systematic and statistical uncertainties in the relative normalization between the cylindric 
M 2 (< r) and spherical Ms(< r) mass estimates for triaxial halos. These uncertainties prevent any attempt to 
couple 2D and 3D constraints without undertaking a complete tridimensional analysis. Such asphericity/projection 
effects should be a major concern for comparisons between lensing and X-rays/dynamics mass estimates. 

Key words. Cosmology: Dark Matter - Galaxies: Clusters: General, MS2137 - Galaxies: Elliptical and Lenticular, 
cD - Cosmology: Gravitational Lensing - Galaxies: kinematics and dynamics 


1. Introduction 

The issue of the late non-linear evolution of cosmic struc¬ 
ture is essentially addressed via large N-body cosmolog¬ 
ical simulations. It is important to test their validity by 
comparing the small scale matter distribution to numeri¬ 
cal predictions. Two observations act as key-tests for the 
CDM paradigm: the mass distribution of dark matter ha¬ 
los (radial density profile and triaxiality) and the abun¬ 
dance of sub-halos within main halos. This work focuses 
on the former issue. 

Most CDM simulations predict a universal profile of 
the general form: 

p{r/r s ) = p s (r/r s )~ a (1 + r/r s ) a ~ 3 , (1) 

with an inner slope a ranging between a — l and a = 1.5 
(Navarro et al. 1997; Moore et al. 1998; Ghigna et al. 
2000; Jing & Suto 2000). The parameters r s and p s can 
be related to the halo mass (Bullock et al. 2001; Eke 
et al. 2001). Though more recent simulations propose a 
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slightly different universal analytical form (Stoehr et al. 
2002; Navarro et al. 2004). 

The global agreement between observations and simu¬ 
lations is subject to controversy. The inner slope of dark 
matter halos of low surface brightness (LSB) dwarf galax¬ 
ies as inferred from rotation curves tends to favor soft 
cores with a < 0.2 ( e.g. Salucci 2001; de Blok et al. 2003; 
Gentile et al. 2004), leading to the so-called cusp-core de¬ 
bate. Many observations have focused on LSB galaxies 
because their baryonic content can be neglected and the 
dark matter distribution in the halo shall match simula¬ 
tions. However, departs from axisymmetry (triaxial halos) 
make the interpretation of rotation curves more complex 
and could reconcile observations and CDM predictions 
(Hayashi et al. 2004). The question of the very central 
mass profile on dwarfs scales is still open... 

Recently a similar discrepancy at clusters of galax¬ 
ies scales is claimed by Sand et al. (2002, 2004, here¬ 
after Sa04). Using HST images (allowing the modelling of 
strong gravitational lensing configurations) together with 
Keck spectroscopy (providing the radial velocity disper¬ 
sion of stars in the central cD galaxy of the cluster, the 
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BCG) on a sample of six clusters, these authors found 
that the inner slope of the dark matter halo must be 
significantly flatter than that measured in simulations. 
Typically, on a subsample of three clusters with radial 
arcs, they found an inner slope a = 0.52 ± 0.05 ( 68 %CL). 
This result takes advantage of the joint constraints pro¬ 
vided by lensing and stellar kinematics. However, the lens- 
ing part of the analysis of Sa04 has been independently 
discussed by Bartelmann & Meneghetti (2004) and Dalai 
& Keeton (2003) because they did not take into account 
the lens ellipticity when using the critical lines radii as 
a constraint on the density profile. These two latter au¬ 
thors found that the mass profile is consistent with a NFW 
model. The analysis of Sa04 couples 2D projected (from 
lensing that deal with mass enclosed in the cylinder of 
radius R) and 3D tridimensional (from stellar dynamics 
which project an indirect information on the mass en¬ 
closed in the sphere of radius r) mass estimates. 

Comparing lensing and X-rays cluster mass estimates 
is another way to couple 2D and 3D mass constraints. 
The overall agreement between X mass and the mass en¬ 
closed in the Einstein radius of clusters are been addressed 
by various authors (Miralda-Escude & Babul 1995; Allen 
1998; Wu 2000; Arabadjis et al. 2004; Smith et al. 2005). In 
most cases, slightly depending on the presence of cooling 
flows or the degree of relaxation of the cluster, strong lens¬ 
ing mass estimates are often larger by a factor > 1.5. With 
better S/N data, there is an increasing evidence 
that the assumption of spherical symmetry starts 
being oversimplistic and may play a important role 
in this systematic trend (Piffaretti et al. 2003; De 
Filippis et al. 2005; Oguri et al. 2005; Hennawi 
et al. 2005). 

In this paper, we focus on the density profile of the 
cluster MS2137-21 which is part of the Sa04 sample and 
search for further evidence for triaxiality in this peculiar 
cluster. In Sect. 2 we present the strong and weak lensing 
modelling of MS2137 with a NFW model and show that 
it is consistent with all the lensing data at hand from 10 
kiloparsecs to 1 megaparsec. In Sect. 3 we develop a de¬ 
tailed method for the analysis of stellar kinematics and 
apply it to the best fit NFW model derived in the pre¬ 
vious section. We then discuss the overall agreement be¬ 
tween lensing mass estimates and the constraints from the 
stellar kinematics and X-rays observations of Allen et al. 
(2001). In Sect. 4 we investigate the origin on the system¬ 
atic overerestimate of lensing mass estimates as compared 
to that of 3D analyses, and show that the tridimensional 
shape of halos (prolate, triaxial) is likely to explain such 
discrepancies. In Sect. 5 we calculate the statistical prop¬ 
erties of the relative normalization between 2D and 3D 
mass estimates of triaxial halos. We discuss our results 
and conclude in Sect. 6 . 

Throughout this paper, we assume a f l m = 0.3, 
Da = 0.7 and H 0 = 70 ft .70 km s -1 Mpc, leading to the scal¬ 
ing 1" = 4.59 hjQ kpc. 


2. Lens modelling 

2.1. Optical data and x 2 definition 

In this section we focus on the density profile model¬ 
ing using lensing constraints only. The lens properties 
of the cluster of galaxies have been extensively studied 
(Fort et al. 1992; Mellier et al. 1993; Miralda-Escude 1995; 
Bartelmann 1996; Hammer et al. 1997; Gavazzi et al. 2003; 
Sand et al. 2002, 2004; Dalai & Keeton 2003). The clus¬ 
ter’s redshift is Zi = 0.313 and both radial and tangential 
arcs lie at z s = 1.501 (Sand et al. 2002), leading to the 
critical surface density £ cr ; t = 2.39 x 10 9 ft .70 M 0 kpc -2 . 

Our analysis builds on the previous work of Gavazzi 
et al. (2003) (Hereafter G03). More precisely, we use 26 
multiple conjugate knots in the tangential and radial arcs 
systems. The method and the knots location are presented 
in G03. Here, we inflate the uncertainties on knot 
positions in order to account for possible bad as¬ 
sociations. Basically, the mean positional error is 
raised to the more realistic value cr x = 0"3. After 
a more detailed analysis of images, the G03 value 
= 0.18" turns out to be underestimated. Moreover 
there was a mistake in the calculation of error bars 
for model parameters in this earlier paper. The un¬ 
certainty on each knot location is just increased by 
the same amount, so we do not expect any change 
in the best fit model. Dalai Keeton (2003) pro¬ 
ceeded in the same way by inflating the G03 errors 
up to a value of 1" which is far too much. We stress 
that the error bars of G03 on the best fit param¬ 
eters are larger than the ones we shall present in 
the following although they assumed smaller un¬ 
certainties on knots locations. This is a clear evi¬ 
dence for an error in the analysis. The present up¬ 
dated results should be considered as correct. We 
also exclude constraints from the fifth central demagni- 
fied image reported in G03 since its detection is marginal 
and is not confirmed by Sa04. We use a personal ray¬ 
tracing inversion code which includes many aspects of the 
lensmodel software (Keeton 2001a,b). In particular, we 
adopt the same source plane x 2 rc definition. 

In addition, we simultaneously include weak lensing 
constraints also presented in G03. The catalogue of back¬ 
ground “weakly lensed” galaxies comes from VLT/FORS 
and VLT/ISAAC images for which we were able to derive a 
good estimate of photometric redshifts using UBVRIJK 
bands. We fully compute the likelihood as a function 
of model parameters (Schneider et al. 2000; King & 
Schneider 2001). 


JVbg 

Cwl=Y\_P( e i) ( 2 ) 

i=l 


where is the observed ellipticity of the background 
galaxy. We have 


P(ei) 


p s {e s {ei,gi )) 


de s 

de* 


( 3 ) 
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e s being the source ellipticity and gi = g{Gi,Zi) is the 
reduced shear. See Geiger & Schneider (1998) for the de¬ 
scription of the relation e s (e,;, g) and for the corresponding 
transformation Jacobian. Ellipticities are measured on the 
I band image. We improved the previous analysis of G03 
and built a new PSF smearing correction pipeline based 
on the KSB method (Kaiser et al. 1995) but leading to 
a better weighting scheme (Gavazzi et al. 2004). We fully 
take into account redshift information, either photomet¬ 
ric 1 for weak lensing or spectroscopic for strongly lensed 
arcs. The use of photometric redshifts to select the sample 
of background galaxies avoids the problem of contamina¬ 
tion by foreground unlensed galaxies (Broadhurst et al. 
2005). The global y 2 for lensing is : 

xL s = xL - 21n£ wi • (4) 

We use the minuit library 2 to minimize this y. The 
error analysis is performed using both minuit facilities 
and Monte-Carlo Markov Chains (MCMC) based on the 
implementation of Tereno et al. (2005). We chose to use 
MCMCs because minuit has difficulties to draw Ay 2 con¬ 
tours in the parameter space when there are strong degen¬ 
eracies. In order to fasten the convergence of the chains, we 
previously run many minuit optimizations starting from 
a broad range of initial conditions. From the well defined 
best fit minimum position, we started up to five chains 
with each of them ending with 6 x 10 4 relevant iterations. 
The convergence assessment was done in the same way as 
in Tereno et al. (2005). 

2.2. Luminosity profile 

We fitted the central cD surface brightness on the 
F702/WFPC2 Hubble Space Telescope image (Hammer 
et al. 1997) with a general projected stellar density profile 
assuming that all stars have a constant mass-to-light ratio. 

We assumed the following analytic expression for 
the three-dimensional radial distribution of stars : 

p*{r) = p sir x~ s {l + x) s ~ 4 , (5) 

where x = r/r s *, r s * is a scale radius. We consid¬ 
ered the particular values 5 = 1 for the Hernquist 
profile Hernquist (1990) and 6 = 2 for the Jaffe 
profile Jaffe (1983) which were added to the 2 
dimensional galaxy model fitting software galfit 
(Peng et al. 2002). Before fitting the luminosity 
profile is convolved by the HST/F702 PSF. The 
Hernquist fit gives an axis ratio q * = b/a = 0.83 ± 0.12, 
a scale radius r s * = 11.1 ± lffih^kpc = 2'.'4 ± O'/l and 
a reduced y 2 /dof = 10.2. This latter value could be no¬ 
ticeably decreased by taking into account a rotation of 
the major axis position angle within the inner 3 arcsec 
(see Fig. 2 of G03). We also tried to fit a Jaffe pro¬ 
file as proposed by Sa04 but we found a much worse 

1 using hyperz facilities (Bolzonella et al. 2000), see also G03 

2 http://cernlib.web.cern.ch/cernlib/ 


y 2 /dof = 99.0. However, Sa04 found that the stellar 
density profile is well fitted by a De Vaucouleur 
model which is bracketed by the Hernquist and 
Jaffe models. Therefore we consider that the stellar com¬ 
ponent is well modeled by an Hernquist density profile 
with r s * = 11.1 h^Q kpc and a total rest frame V band 
luminosity L v = 4.77 ± 0.40 x 10 11 hf 0 2 L 0 (Sa04). The 
mass content in stars is M* = 2np st r^q* = YyLy where 
Yy is the rest-frame V band stellar mass-to-light ratio. 

Throughout the paper, we shall discuss the con¬ 
sequences of this particular choice. At this level, 
we expect the effect of this choice to be more 
important for stellar kinematics than for lensing. 
This can be understood because strong lensing 
constraints probe the total density profile well 
beyond the stellar scale radius where Hernquist, 
Jaffe or De Vaucouleur profiles are very similar 
(see Fig. 2 of Sa04). 

2.3. NFW dark matter density profile 

The lens is decomposed into two components. The cD stel¬ 
lar component is modeled by the elliptical Hernquist pro¬ 
file of the previous section. The stellar mass-to-light ratio 
Yy is treated as a free parameter with a broad uniform 
prior 1.5 < Yy < 10 whereas the scale radius, orientation 
and ellipticity are fixed by the observed light distribution. 
The dark matter halo is modeled with an elliptical NFW 
density profile: 

Pdm(^) = Ps {r/rs)- 1 (1 + r/r s )~ 2 . (6) 

The lens properties of such a density profile are presented 
in (Bartelmann 1996). We used numerical integrations al¬ 
gorithms for the elliptical 3 Hernquist and NFW density 
profiles (Keeton 2001a). The model has five free parame¬ 
ters : 

— the dark halo scale radius : r s , 

— the characteristic convergence : n s = p s r s /E cl it, 

— the dark halo axis ratio : q = b/a, 

— the dark halo major axis position angle : ipo, 

— the stellar mass-to-light ratio : Yy = M*/Ly. 

Fig. 1 shows the best fit NFW strong lensing configura¬ 
tion. One can see the source and image planes with caustic 
and critical lines together with the location of the 26 mul¬ 
tiply images knots. Every observed point (circle) is well 
reproduced by the model (+ signs). The source associated 
to the tangential arc is clearly crossing the corresponding 
caustic line (inner astroid) whereas one can only see the 
part of the source associated to the radial arc that is inside 
the radial caustic (since the part outside the caustic is not 
multiply imaged and is useless for modelling). The central 
image associated to the tangential system is plotted but is 

3 which are not approximated by elliptical lens potentials 
(numerically faster but leading to unphysical surface mass den¬ 
sity at large radius) 
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Fig. 1. NFW best fit model for strong lensing constraints. Left: Caustic lines with the position of the 26 knots in the source plane. Right: 
Critical lines with the observed (resp. model) position of the 26 knots represented with circles (resp. + signs). Geometrical units are arcsec. 


not taken into account in the modelling. Besides the crit¬ 
ical lines location (the only constraint used by Sa04), our 
model also remarquably explains the position of counter¬ 
images, the azimuthal configuration, the length and width 
of arcs. 

The model requires a rest frame V band stellar mass- 
to-light ratio Yy = 2.5 ± 0.4. This value is in good 
agreement with expectations of evolution of 2 < t < 4 
Gyr old stellar populations. The reason why the stel¬ 
lar mass content is so tightly constrained is that 
the stellar and dark matter components are not 
aligned. There is a position angle misalignment 
of A'lp = 13°. This was first pointed out by G03. 
Otherwise, there would be a degeneracy between 
the relative contribution of dark matter and stars. 
Here the degeneracy is broken though the contri¬ 
bution of stars is subdominant at all scales (and 
a factor ~ 2 at the centre) as shown in Fig. 3. 
This situation well explains the small inaccuracy 
in the radial arc modelling highlighted in Sect. 4.2 
of G03. By adding a small misaligned contribution 
under the form of stars at the center, one is able to 
twist to isopotentials and precisely reproduce the 
radial arc and its counter-image. See Romanowsky 
&: Kochanek (1998) and Buote et al. (2002) for a 
similar example. We shall turn back to this issue 
in Sect. 4.4 and appendix B. It is worth mention¬ 
ing that changing the stellar mass profile to a Jaffe 
model does not make differences. The total (mis¬ 
aligned) stellar mass is well fixed by lensing. 

In Table 1 we present the best fit NFW model param¬ 
eters in terms of more physical quantities like the virial 
radius r 2 oo, the concentration parameter c = T 2 oo/r s or 
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Fig. 2. Scatter plot showing the projection of MCMCs on some planes 
of the parameters space of the NFW lens modelling. The color codes 
for the ellipticity parameter according to the scale at the bottom right 
corner. The axes are the virial mass M 200 , the concentration c and the 
rest-frame V band stellar mass-to-light ratio Ty = M*/Ly. 


the virial mass M 200 that all derive from k s , r s and q. 
At the best fit parameter set, the minimum y 2 value 
is y 2 /dof = 4931.20/4965 = 0.993. When consid¬ 
ering strong lensing constraints only, y 2 rc /dof = 
76.4/130 = 0.59 showing that both strong and weak 
lensing observations are well modeled 4 . The (SL) 
and (SL+WL) columns detail how the best fit model is 
changed whether weak lensing constraints are added to 
the model or not. Basically, errors are just reduced and 
no significant change in the best fit parameters value is 

4 If we have used the former positional uncertainties of G03, 
the best fit model would not have been changed but we would 
have found a minimum y 2 rc /dof =1.64 which is also a accept¬ 
able fit. 
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(SL) 

(SL+WL) 

(1) 

unit 

Ks 

0.67 ±0.05 

0.66 ± 0.03 

0.30 + 0.15 


r s 

158±i| 

1621“ 

160 ± 30 

hjQ kpc 

r 200 

1.88 ±0.05 

1.89 ± 0.04 

1 QQ+0.49 
-L.oy_o.38 

hjg Mpc 

A /200 

7.56lg;g 

7 <79+0.47 

1 ■ 1 ^-0.42 

- 

10 14 h^M & 

c 

n.92±g : ;i 

11.73 ±0.55 

0 7 +1-2 
-0.9 


Q 

0.774 ± 0.010 

0.777 + 0.007 

- 


tpo 

5.86 ±0.14 

5.88 + 0.13 

- 

deg 

T v 

2.40 ± 0.45 

2.48 ± 0.39 

- 

h 7 0 (M/L)q 


Table 1. Best fit NFW model parameters and their 68% CL uncertainty (marginalized over all the other parameters). (SL) corresponds to a 
model in which weak lensing constraints are ignored whereas (SL+WL) takes both strong and weak lensing constraints into account. (1) refers to 
the CHANDRA X-rays values of Allen et al. (2001). The apparent disagreement between their estimates and ours is discussed in the text. 


observed. Fig. 2 shows the degeneracies between the con¬ 
centration parameter, virial mass, stellar mass-to-light ra¬ 
tio and ellipticity (color-coded). 

Fig. 3 shows the radial projected mass profile 
for the best fit NFW+stellar components as well as 
a detail of the stellar component. The thickness of 
curves is representative of the 1 —a uncertainties. 
This is done by considering many points of the 
MCMCs that stand within the l-er contour about 
the best fit model. 

2.4. A more general profile for dark matter 

The quality of the fit is such that very few departs from the 
NFW model we found are allowed. In order to check how 
restrictive the analytical form (6) is, we also assumed the 
following profile for the dark matter component (Wyithe 
et al. 2001) 

Pdm(j') =p s x~ a (l + a; 2 ) (Q 0)/2 , (7) 

with x = r/r s the radius in units of a scale radius r s and 
two more free parameters: the asymptotic inner and outer 
slopes a and (3 respectively. We will refer to this model as 

the “gen” profile. This model slightly differs from the 
generally assumed generalised gNFW model p oc 
x~ a (1 + x) a 3 and a comparison to previous studies 
is not straightforward. Nevertheless we chose this 
model because it is computationally tractable even 
with elliptical symmetry (Chae et al. 1998; Chae 
2002) and allows another degree of freedom since 
the outer slope is not fixed. 

For the best fit model, we have y 2 /dof = 
4915.90/4963 = 0.990, and xL/ dof = 64.2/128 = 0.50. 
Here again, the y 2 value is satisfying 5 . We found 
Yy = 2.09 ±0.16 also consistent with stellar evolution 
models, /3 = 2.691q; 22 and a ~ l-262+Q;g 55 .The con¬ 
straints on a are very tight and show that lensing is in¬ 
consistent with any soft core a<l. However, it does not 
contradict the NFW behavior p oc r _1 at small scales be¬ 
cause the fast transition 1 ± x 2 in the “gen” profile differs 

5 It would have increased to Xsrc/dof = 1.39 using the G03 
uncertainties. 


from the NFW case (1 ± x). This can clearly be seen in 
Fig. 3 where the projected NFW and “gen” mass profiles 
match over a broad radius range (r < 100 kpc). The dif¬ 
ferences at larger scales are still within the weak lensing 
uncertainties. Consequently, we can faithfully trust the 
radial behavior of the lensing deduced mass profile of the 
NFW model between 10 < R < 1000 kpc. 

Here again, changing the Hernquist stellar pro¬ 
file to a Jaffe model does not change our results. 

2.5. Comments 

Fig. 3 also shows the best fit model of Sa04 which presents 
strong discrepancies with both our NFW and “gen” mod¬ 
els. Though the projected mass at the tangential arc ra¬ 
dius (~ 100 kpc) matches our estimates, the model of Sa04 
is inconsistent with most lensing constraints. They im¬ 
posed the radial critical line to fit the observations but 
their model cannot reproduce the radial arc length and 
its counter-images well, nor the tangential arc width and 
weak lensing at R > 200 kpc. This can be understood 
by comparing the circularly averaged deflection a. profile 
of these models in the upper panel of Fig. 4. This plot is 
used to solve the lens equation graphically. The tangential 
critical radii (intersection of curves a(r) and y = r) are 
consistent from one model to another. As well the curves 
a(r) are tangent to the line y = r + u at the same ra¬ 
dial critical radius. But the intersections of a(r) and this 
line at the opposite side, which give the location of the 
counter-image of the radial arc, significantly differ (~ 2 
arcsec). Moreover, we can see that the Sa04 model pre¬ 
dicts a much more elongated radial arc that could extend 
very close to the lens center. This is clearly excluded by 
the data. It is worth noticing that the Sa04 model predicts 
another radial critical line at the very center (r ~ 0.1") 
6 and globally higher magnifications since a(r) is close to 
the bisectrix y = r. The only models consistent with all 
lensing constraints are the ones similar to the NFW and 
the “gen” models. 

The column (1) of Table 1 also resumes the NFW 
model parameters deduced from Chandra X-rays observa- 

6 which makes the issue of the fifth image worth observation- 
aly addressing. 
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Fig. 3. Projected total mass (stars+DM) profile within cylinder of radius R for the best fit NFW model (black), the best fit general “gen” model 
with cx. = 1.26 and (3 = 2.6 (red), the best fit model of Sand et al. (2002, 2004) (green) and the best fit NFW model from X-rays constraints 
(Allen et al. 2001) (orange). The contribution of stars for the NFW model is the cyan dashed curve. The width on the curves is representative of 
the 1 —<y uncertainties (except for the Sa04 profile). The blue region is the domain constrained by the weak-lensing (^-statistic. The location of the 
tangential (resp. radial) critical lines is reported by a blue “T” (resp. “R”). The “gen” and NFW models match well from the center to the inner 
radius probed by weak-lensing. Beyond, their are consistent within the weak-lensing uncertainties. For R < 40 kpc both profiles present large (a 
factor ~ 2) discrepancies with the curves inferred by Sa04 and A101. See text. 


tions of Allen et al. (2001, hereafter A101). The projected 
mass profile of their model is the orange thick curve on 
Fig. 3. This NFW profile is twice as low as our NFW 
and “gen” models over a range 10 < R < 200 kpc ( i.e. 
the factor 1/2 in the value of k s in table 1). At larger 
scales R > 300 kpc, the A101 mass profile becomes con¬ 
sistent with weak lensing and our models. X-rays (A101) 
and stellar dynamics (Sa04) mass estimates agree at small 
scales R < 50 kpc. 

3. Dynamics of stars in the BCG 

The kinematical properties of stars in the central cD 
galaxy are studied in this section. Instead of using the 
standard Jeans equation to relate the gravitational poten¬ 
tial and the velocity dispersion of tracers, we fully cal¬ 
culate the line-of-sight velocity distribution LOSVD via a 
thorough dynamical analysis which is detailed in appendix 
A. By doing so, we can estimate the biased velocity disper¬ 
sion profile a\ u ( R ) in place of the true velocity dispersion 
of stars erio S (i?) due to the assumed gaussianity of absorp¬ 
tions lines. 


The analysis presented in appendix A shows that de¬ 
parts from gaussianity are kept at a low level for the 
lensing-deduced NFW mass model. For isotropic orbits 
Gaussianity is a fair assumption: (<7h 4 — cqos)/cios ~ —13% 
at R ~ 1 kpc and then decreases whereas departs can 
reach ~ 30% for anisotropic orbits. With this mass model 
we plot ai os (R) and o\ U (R) on the top panel of Fig. 5 
for different values of the anisotropy radius r a = oo and 
r a = 10 kpc. The agreement between the measure¬ 
ments of Sa04 and <7h 4 (-R) is better than with cri os (i?) but 
introducing anisotropy cannot improve the fit quality for 
R ~ 10 kpc: the eri l4 curve of the NFW model raises too 
fast whereas data indicate a declining tendency. However, 
if kinematical data would extend to slighly larger scales, 
we expect the profile to start raising and get closer to the 
model beyond a few tens of kpc as observed in others cD 
galaxies (Dressier 1979; Kelson et al. 2002). 


We attempted to couple lensing and kinematical con¬ 
straints by minimizing the merit function Xtot = xf e ns + 
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Fig. 4. Upper panel: Circularly averaged deflection angle for our best 
NFW fit (solid black curve) compared to that of Sa04 (solid red curve). 
From left to right, the blue vertical lines represent the radial and tangen¬ 
tial critical radii, the Sa04 and our prediction of the radial arc counter¬ 
image. The critical radii match from one model to another but the radial 
arc lenght significantly differs. The dotted y = r line gives the solution 
of the tangentiel critical radius. The dashed lines y = r + u (with u the 
source location) give the solution of the radial critical radius where it 
is tangent to the curve y — r. Lower panel: Magnification profile. The 
Sa04 model predicts high magnifications close to the center with another 
radial critical line at the very center (r ~ 0.1 // ). 


Xkin. With 



0 12 3 


-Nbins 

Xkin = J2 


(^h 4 ] ^rnes.i) 


( 8 ) 


that accounts for kinematical constraints and ,\f ens defined 
in Eq. (4). and o'mes.i are the measurements and er¬ 
rors of Sa04, and iVbi ns = 8. The inferred NFW model is 
marginaly changed as compared to the one found using 
lensing only. The NFW model is overconstrained by lens- 
ing and cannot fit the Sa04 kinematical data better. We 
find x£ in /./Vbins ~ 7 for the best model 7 . After y 2 mini¬ 
mization, the NFW model is still unable to reproduce the 
velocity decline at R > 4 kpc. 

Changing the Hernquist stellar light profile by 
a Jaffe model as proposed by Sa04 slightly im¬ 
proves the fit of kinematical data without altering 
the lens modelling (see Sect. 2.3). In this case, we 
have Xkin/Abins ~ 6.5. The velocity dispersion curve 
raises a bit slower as compared to the Hernquist 

7 We find Xkin/NbinB ~ 8 if we change a h4 by cri os in Eq. 8, 
ie if we neglect the velocity bias due to non-gaussian LOSVD. 
Consequently, the correction has a weak effect on the mod¬ 
elling. 


Log 10 (r) [h 7Q ^pc] 

Fig. 5. Velocity dispersion cr\ os (R ) (solid green line) and biased ve¬ 
locity <Th 4 ( R ) (dashed blue line) profiles for isotropic orbits. The mea¬ 
surements of Sa04 are represented by red boxes. Top: Our best fit 
NFW model. In this panel we have also represented another couple of 
(trios,<Th 4 ) curves (cyan and magenta respectively) which correspond to 
an anisotropic Osipkov-Merritt case with r a = 10 kpc. The introduction 
of anisotropy does not improve the fit quality but leads to huge departs 
between cri os and <Jh 4 . Middle: Best fit model of Sa04 which matches 
the data well. Bottom: Best fit model of A101, inferred from X-rays 
analysis. This model also matches the measurements of SaOl whereas 
none of these latter profiles are consistent with lensing observations. 


case. However, since the mass budget is dominated 
by dark matter, there is not much improvement. 
Lensing constraints are so tight that the allow re¬ 
gion in the parameter space is completely fixed. 

Likewise the “gen” model also fails in reproduc¬ 
ing kinematical data although it has more free 
parameters. In this case, the inner slope a = 
1.250 ± 0.011 is still fixed by lensing. The inferred 
stellar mass-to-light ratio is Tv = 1.83 ±0.14 which 
is a rather low value. For the “gen” profile too, 
switching the stellar mass profile to a Jaffe model 
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does not significantly improve the fit on kinemat- 
ical data. 

We have shown that departs from gaussian absorption 
lines induce a small bias which starts being important for 
dynamical systems with radial orbits. However this bias 
is unable to explain the discrepancy between lensing and 
kinematical mass estimates. Furthermore, such a bias can¬ 
not be advocated to explain the discrepancy between lens¬ 
ing and X-rays mass estimates. We can see on the central 
and bottom panels of Fig. 5 that the mass model of Sa04 
fairly reproduces kinematical data, as well as the A101 
model (provided one addes the contribution of a central 
cD galaxy with Yy ~ 2.5). 

4. Discrepancies between mass estimates 

At this level, let us resume the main problems that arise 
from the previous sections. A detailed lens modelling pre¬ 
dicts a robust projected mass distribution that is consis¬ 
tent with NFW universal profiles. We have used a more 
general density profile for the dark matter halo in order to 
check that any other realistic mass profile should match 
our best fit NFW model over a broad range 20 < R < 
1000 kpc. This family of models turns out to be incon¬ 
sistent with the X-rays and kinematical mass estimates 
that are basically indirect measurements of the 3D mass 
within radius r. These two latter estimates are mutually 
consistent for R < 50 kpc. 

Since lensing is sensitive to the integrated mass 
contrast along the line of sight, it is natural to 
expect overestimates due to fortuitous alignments 
with mass concentrations which are not physically 
related to the main halo of interest. Likewise, de¬ 
parts from spherical symmetry are observed in 
N-body simulations ( e.g. Jing & Suto 2002) and 
may bias lensing estimates. This question has been 
addressed by various authors (Bartelmann 1995; 
Cen 1997; Reblinsky & Bartelmann 1999; Clowe 
et al. 2004; Wambsganss et al. 2004; Hennawi et al. 
2005). Conclusions about the importance of unre¬ 
lated structures (large scale structure LSS) slightly 
differ from one author to another. Hoekstra (2003) 
found LSS to add noise to mass estimates on large 
scales but do not lead to biased estimates since 
on very large scales the skewness of the density 
field is negligible and light rays cross overdense re¬ 
gions as well as underdense ones. At smaller scales, 
this becomes obviously wrong and one expects for¬ 
tuitous alignments of halos to modify the prop¬ 
erties of halos. Wambsganss et al. (2004) claim 
that such effects can increase the lensing mass of 
~ 30 — 40% of halos by a factor of ~ 15 — 20% whereas 
Hennawi et al. (2005) found this effect to change 
the lensing cross-sections of clusters by a smaller 
amount (< 7%). See also (Hamana et al. 2004) and 
(Hennawi &: Spergel 2005) for a discussion of pro¬ 
jection effects on weak lensing cluster surveys. 


On smaller scales, Metzler et al. (2001) found 
the mass of surrounding (sub)structures like fila¬ 
ments to add a significant contribution to the total 
convergence of a cluster-size lens whereas Clowe 
et al. (2004) showed that triaxiality is an impor¬ 
tant issue for lensing mass estimates. In the follow¬ 
ing, we shall focus on this paticular aspects which 
has been found to be important for lensing Oguri 
et al. (2003); Oguri Keeton (2004) and/or X- 
rays observations (Piffaretti et al. 2003; De Filippis 
et al. 2005; Hennawi et al. 2005). 

For a triaxial or oblate/prolate halo, the ratio of the 
mass enclosed in the cylinder of radius R to the mass en¬ 
closed in the sphere of same radius will differ from that of a 
spherically symmetric situation. In order to illustrate pro¬ 
jection effects, we consider an axisynnnetric (either oblate 
or prolate) NFW density profile of the form : 

1 o 2 2 

p{m) = ——-, with m 2 = R 2 + — . (9) 

m(l + m) z q z 

The line of sight is along the z-axis and matches the ma¬ 
jor axis of a prolate halo when q > 1 or the minor axis of 
an oblate halo when q < 1. We can express m = r\(q, fi) 
with /i = cos (9 and A 2 = 1 + g 2 {l/q 2 — 1). Numerical 
simulations predict triaxial halos with minor axis 
and intermediate axes c and b with a distribution 
given by (B.8). With these relations, we can numer¬ 
ically calculate the distribution of q which is close 
to gaussian by approximating q = c/\fb = 0.62 ±0.12 
(resp. q = 1/Vbc = 1/0.64 ± 0.25) for an oblate 
(resp. prolate) halo. This is a rough approximation 
since realistic triaxial halos are not systematically 
aligned with the line of sight but this gives an idea 
of acceptable values of the axis ratio q. . 

Since we are interested in ratios between mass esti¬ 
mates we pay no attention to normalization constants and 
write the exact mass M true (r; q) enclosed by the sphere of 
radius r as 

M tIue (r;q) = J ^jM true (Ar; 1) , (10) 

where M trU e(?*; 1) = ln(l + r) — jjpp for a NFW profile. 
We now calculate the mass Mi ens (r; q) (resp. Mki n , Myj) 
as it would be found from lensing (resp. stellar kinemat¬ 
ics, X-rays) measurements performed assuming spherical 
symmetry. 

4.1. Projection effect on lensing 

Since lensing measures the projected mass along the line 
of sight and owing to the fact that the major/minor axis 
is aligned, the net effect of asphericity is to multiply the 
surface mass density by q. So, we can write 

Afi en s(r; q) = gM true (r; 1) . (11) 

We plot the ratio Mi ens (r; g)/M true (r; q) as a function of 
radius for various values of the axis ratio q on the top 
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panel of Fig. 6. We can observe strong discrepancies for 
extreme values of the axis ratio. Departs between lensing 
and true masses tend to vanish at large scale. Thus, one 
expects a lower effect of asphericity on weak lensing based 
mass estimates. 


4.2. Projection effect on kinematics 


Regarding stellar kinematics, projection effects are much 
more complex because stars are expected to move faster 
along the major axis and boost the mass estimate. 
Assuming a distribution function of stars of the form 
f{E, L z ) and a reduced gravitational potential \E '(R, z) the 
Euler Jeans equations read : 


-d R {wl) + 


R 


= cbj’F 


-d z {vvl) = d z * . 

V 


(12a) 

(12b) 


(Chandrasekhar 1960; Hunter 1977; Binney & Tremaine 
1987). Thus, we can express the components of the veloc¬ 
ity ellipsoid: 


vv%, = vv 2 = — 


d z' vd z 4l 


(13a) 


vRPl 2 = - 


d z' [d R vd x * - d x vd R V] . (13b) 


with i? 2 n 2 = Vq — v 2 R . The main difficulty is to compute 
the potential and its derivatives generated by an ellip¬ 
soidal distribution of mass p(m). To do so we use the 
formalism of Chandrasekhar (1969) (see also Qian et al. 
1995) : 

- J'o = - 2 ttG<? / / d mmp(m) 


*(R, 


A(«) 


d R -4> = - 2tt GqR 


d z 4> = — 27t Gqz 


dit 


A(w)(l 

dw 


+ u) 


(14a) 
p(U) (14b) 

M u ) , (14c) 


Jo A (u)(q 2 + u) 

with 4/o the central potential (which is not relevant for 

our purpose), A (u) = (1 + u)\/q 2 + u and U 2 = AA + 
2 

2 z +m ■ For simplicity we assume that the density of tracers 
v{R,z) does not contribute to the potential (massless). 
We also assume that the density of tracers is ellipsoidal 
v(R, z) = v(m) with the same axis ratio as the dark halo. 

We now calculate the observable luminosity-weighted 
line-of-light velocity dispersion 


Iai os (R) = / d zv(R,z)v 2 z . 


(15) 


If this quantity is assumed to be due to a spherically sym¬ 
metric system and is deprojected according to 


1 


v (r) =- 

7T 


cl I{R) d R 


vv 2 (r) = — 


1 


d R Vi? 2 _ r 2 
dIa\ os (R) d R 


d R 


VR 2 - r 2 


(16a) 

(16b) 


one will calculate a biased radial velocity dispersion v 2 . 
The corresponding biased mass profile M k ; n (r) is given 
by the standard Jeans equation (with isotropic velocity 
tensor): 


M kin (r) = 


v 2 r d In uv 2 
~G dlnr 


(17) 


We plot the ratio Mki n (r-,q)/M tTue (r;q) as a function of 
radius for various values of the axis ratio q on the middle 
panel of Fig. 6. Here again projection effects can be huge 
for extreme values of q. Unfortunately, M k ; n (r; q) depends 
on the profile of tracers v{r) and the details of this figure 
cannot be representative of a general oblate/prolate NFW 
halo. For instance the bump at r < 1 is due to our assumed 
density of tracers which here corresponds to our model of 
MS2137. However, departs between Mki n{r;q) and M trU e 
are always important for high q or 1 jq. 


4.3. Projection effect on X-rays 

Similarly, we calculate the perturbation of asphericity on 
X-rays mass estimates. As compared to lensing or dynam¬ 
ics the effect is expected to be weaker since the gravita¬ 
tional potential is systematically rounder than the mass. 
For simplicity we assume that the gas (with density p g ) is 
isothermal and in hydrostatic equilibrium. 

We have p g oc exp[—T/U 0 2 ] with Vjj = kT / pm p . The 
X-rays surface brightness of the optically thin gas distri¬ 
bution is 

S X (R) oc y exp[—2'F(i?, z)/Vj]Az, (18) 

with 4/(i?, z) given by (14a). Here again, when interpret¬ 
ing this surface brightness distribution as arising from a 
spherically symmetric system, one will deproject Sx(R), 
obtain a biased gas density p g and use it in the following 
equation to obtain the biased mass profile Mx(r). 


M x (r) 


Vjjr d In p g 
G dlnr ' 


(19) 


We plot the ratio Mx(r;q)/Mt rue {r;q) as a function of 
radius for various values of the axis ratio q on the bottom 
panel of Fig. 6. As expected Mx(r; q)/M tIue (r ; q) exhibits 
much less scatter about unity as compared to the two 
previous mass estimates. The asymptotic divergence at 
very large scales (r > 10) is a numerical artefact of our 
crude deprojection algorithm. 


4.4. Comments and application to MS2137 

Fig. 6 clearly shows that moderate values of the axis ratio 
q can lead to strong discrepancies between 2D and 3D 
mass estimates or between lensing and X-rays or stellar 
kinematics. 

It is difficult to fully characterize the ratio 
Mi ens (r-,q)/M) i i n (r;q) because it depends on the dis¬ 
tribution of tracers v( y R 1 z) and is severely sensitive 
to the orientation of the axis ratio relative to the 
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Fig. 6. Radial behavior of ratio between mass estimates for various 
values of the axis ratio q — 0.2, 0.4, 0.6, 0.8, 1., 1/0.8, 1/0.6, 1/0.4, 1/0.2. 
Top Panel: Ratio M\ ens (r\q)/Mtr\ie(j'\q). Middle Panel: Ratio 
Mki n (r] q )/<?)• Bottom Panel: Ratio Mx(r; g)/Mtrue(^» <?)• I n 
this latter case, one can see much lower values whatever the axis ratio. 
The asymptotic behavior is slightly affected by numerical instability for 
r > 10 and extreme values of q. 


line of sight. Therefore a direct comparison between 
lensing and dynamical mass estimates is hazardous. 
Mi ens (r; q)/M^ in (r;q) can have a different radial behav¬ 
ior as a function of radius for a given axis ratio. It can be 
either greater of less than unity. 


Comparing lensing and X-rays mass estimates is easier 
since the X-rays mass estimate is less sensitive to projec¬ 
tion effects. In this respect Mi ens (r;q)/Mx(r’,q) will sys¬ 
tematically be > 1 (resp. < 1) for prolate (resp. oblate) 
halos with a well known radial behavior. 

In the case of MS2137, a prolate halo with q ~ 0.4 
could well explain most discrepancies between our best 
fit models and the results of Sa04 and A101. A prolate 
halo aligned toward the line of sight is a natural expla¬ 
nation for the high concentration parameter we found 
c = 11.73 ± 0.55 and may also explain the high con¬ 
centrations c ~ 22 in CL0024 (Kneib et al. 2003) and 
c = 13.7lJ;i A1689 (Broadhurst et al. 2005). Recently, 

Oguri et al. (2005) have investigated the effect of triaxial¬ 
ity in A1689 and have similar conclusions as well as Clowe 
et al. (2004) who studied numerical simulations (see also 
Piffaretti et al. 2003). 

At this level, it is not possible to simply refine the 
modelling of MS2137, since our prolate model is idealized. 
It should be triaxial and/or not perfectly aligned with the 
line of sight just because the projected density profile is 
elliptical. However the hypothesis of a projected triaxial 
halo also provides a direct explanation for the misalign¬ 
ment between the projected diffuse stellar component of 
the cD and the projected dark matter halo A if) = 13.0±0.5 
deg. Binney (1985) and Romanowsky & Kochanek (1998) 
give the necessary formalism to infer the position angle 
and projected ellipticity of both dark and luminous halos 
from their tridimensional triaxial shape and orientation. 
The information that can be derived from the geometry 
of projected light and dark matter densities is detailed in 
appendix B. Basically, these independent constraints give 
the following results for the orientation 6 (polar angle of 
the major axis with respect to the line of sight), the minor 
axis ratios cdm and c* of dark matter and stellar compo¬ 
nents respectively: 6 = 27.4 ± 5.1, cdm = 0.55 ± 0.08, and 
c* = 0.52 ± 0.12. It is remarquable that this geometrical 
information is fairly consistent with the value of cdm ~ 0.4 
and a perfect alignement (9 = 0) we assumed to explain 
the mass discrepancies. 

There is sufficient material to be convinced that no 
simple coupling between 2D and 3D mass estimates is pos¬ 
sible. Consequently, we expect that most of the previous 
analyses based on such a coupling should be considered 
with caution, either in terms of significance or in terms of 
possibly biased results. 


5. Discrepancies : an expected general trend 

The aim of this section is to predict the statistics of such 
mass discrepancies between lensing and any other mass 
estimate which is not much sensitive to asphericity effect 
like X-rays. Let us now consider a more general situation 
with a triaxial halo p(m) where m 2 = x 2 + y 2 /b 2 + 2 2 /c 2 
with 0 < b < c < 1 and an orientation relative to the line 
of sight parameterized by the polar angles (0, <j>) or the 
unit vector n. 
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The mass within the sphere of radius r is independent 
on the halo orientation and reads 


Mu 


/»27T i * 7T 

’ <r;M = 4 ni iV l “ 


sin 


M 3 (;zr;l,l) 


( 20 ) 


with v 2 = sin 2 $ (sin 2 ip /b 2 + cos 2 ip /c 2 ) + cos 2 i? and 
again Mt rue (i?; 1 , 1 ) is simply the mass within radius r 
for a spherically symmetric halo. 

The mass M 2 within cylinder of radius r will depend 
on their axis ratios and the orientation n but the system is 
equivalent to an elliptical projected mass distribution with 
axis ratio q and position angle Thus we can express M 2 
as: 


M 2 (r; 6 , c, 




dip M 2 [rr/q x , 1) 
2tt t 2 


( 21 ) 


where r 2 = sin 2 ip + cos 2 ip/q 2 and q , q x and / are given 
by Eqs. (B.7) in appendix B. They depend on the intrin¬ 
sic axis ratios and orientation. M 2 (r;l,l) is simply the 
cylindric mass within radius r for a spherically symmetric 
halo. 

As before, an observer measuring the 3D mass profile 
within radius r will find a different normalization as com¬ 
pared to an observer interested in the cylindric mass of 
radius r. They will differ by a factor 


v(R',b, c, n) 


M 2 (r; b, c,w)/M 2 (r; 1,1) 
M t rue(r; b, c)/M true (r; 1 , 1 ) ' 


( 22 ) 


We can now calculate the statistical properties of this ra¬ 
tio by averaging over the b and c PDFs of Jing & Suto 
(2002) given by (B. 8 ) and the orientation of the major 
axis (assumed isotropic). This can be expressed as : 


PR^) = dc P( c ) J dbp(b\c) 

pi T 27T 

/ sin0d0 / d (j>6 [r/ — r](R-,c,b,n)\ . (23) 

Jo Jo 

We plot on the upper panel of Fig. 7 the distribution 
of p for three fiducial values of R = 0.03, 0.3 and 3 which 
are relevant for strong lensing/stellar dynamics, strong 
lensing/X-rays and weak lensing/X-rays comparisons re¬ 
spectively. We clearly see a broad, shifted and skewed dis¬ 
tribution which converges toward unity with increasing 
radius. However, at small scales, the median value of p is 
not unity and readily extends toward high values p > 1.5. 
Typically p = 1.24±°; 2 | (resp. 1.19^;^, 1.07 ± 0.06) for 
R = 0.03 (resp. 0.3, 3.0). Thus, important departs be¬ 
tween M 2 and Mt rU e are naturally expected if halos are 
effectively triaxial. 

Moreover, there must be a correlation between the ob¬ 
served projected axis ratio q and 77 since the apparently 
rounder halos are likely to be elongated along the line of 
sight. This effect can be seen on Fig. 8 where we plot the 
conditional PDFs p(rj), p(p\q > 0.9), p{q\q > 0.7) and 
p{p\q < 0.7) for a radius R = 0.3. The highest values of 




Fig. 7. Upper panel : Distribution for the mass ratio 77 for three dif¬ 
ferent radii r = 0.03 (the broader black curve), r = 0.3 (the intermediate 
red curve) and r — 3.0 (the narrower green curve). At small scales, the 
distribution is broad and clearly not centered on 77 = 1 , leading to un¬ 
reliable direct normalization between 2D and 3D mass estimates like in 
strong lensing and X-rays/stellar dynamics comparisons. With increas¬ 
ing radius (r > 3.0), departs significantly vanish and explain the overall 
agreement between large scale weak lensing and X-rays mass estimates. 
Lower panel : Mean (thick black curve) and 68.3, 95.4 and 99.7 % 
quantiles (thin red, blue and magenta curves respectively) of the 77 dis¬ 
tribution as a function of radius. 



Fig. 8. For r = 0.3, we show the dependence of 77 on the projected 
axis ratio q. The conditional PDFs p(r)\q > 0.9) (blue) p(rj\q > 0.7) 
(red) and p(rj\q < 0.7) (green) are compared to the overall distribution 
p{r\) (black). The rounder the halo is, the more likely is the major axis 
of the halo oriented toward the line of sight and the higher is 77 . 
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77 are due to the roundest projected halos. For instance, 
given q > 0.7 we have 77 = 1.25lo!i3- 

Finally projection effects of triaxial halos have the in¬ 
teresting properties to explain the fact that weak lensing 
and X-rays measurements generally match since 77 ~ 1 for 
r > a few r s . Likewise the general trend for strong lens¬ 
ing mass estimates to generally be greater (by a factor of 
1 — 3) than X-rays since they occur at scales 0.1 < r < 1 
(Allen et al. 2001; Wu 2000). The relative normalization 
between lensing and stellar kinematics is more complex 
and cannot be representated by the 77 statistic. However 
we expect a similar scatter and a strong dependence on 
the major axis orientation. 

6. Discussion & Conclusion 

Regarding the particular case of MS2137, using a detailed 
modelling of both strong and weak lensing data, we have 
shown that the dark matter density profile must be close to 
NFW. See also (Miralda-Escude 1995; Bartelmann 1996; 
Gavazzi et al. 2003; Bartelmann & Meneghetti 2004; Dalai 
& Keeton 2003) for similar conclusions. We have explained 
the reason why the Sa04 lens model is inconsistent with 
lensing data (radial arc counter image, weak lensing...). 
We have highlighted a possible discrepancy between our 
lens model and other mass estimates from stellar kinemat¬ 
ics in the central cD galaxy and X-rays. 

We have undertaken a thorough dynamical analysis of 
the line-of-sight velocity distribution of stars in the cD in 
order to check whether or not departs from gaussianity 
may explain the relative inconsistency between our lens 
models and stellar kinematics. The effect of non-gaussian 
aborption lines is to slightly lower (~ 15%) the measured 
velocity dispersion estimates of Sa04 but does not greatly 
improve the agreement between our lens model and the 
bias-corrected data. Moreover such a bias cannot explain 
the disagreement between our lens model and A101 X-rays 
mass estimates. This latter 3D mass estimates turns out 
to be consistent with stellar dynamics, showing that there 
must exist some problem in the relative normalization of 
2D and 3D mass estimates. 

These discrepancies can be alleviated if one considers 
the possibility of departs from spherical symmetry for the 
dark matter and stellar components. More precisely, we 
have shown that a prolate halo with its major axis oriented 
close to the line of sight and an axis ratio ~ 0.4 is likely 
to explain the discrepancies. This hypothesis is supported 
by the misalignement (At p ~ 13°) between projected DM 
and stellar distributions. 

Furthermore such a geometrical configuration well ex¬ 
plains the concentration parameter we infer from lensing 
analysis c = 11.73±0.55. A prefered elongated halo toward 
the LOS boosts lensing efficiency (Bartelmann 1995; Oguri 
et al. 2003; Clowe et al. 2004) and may explain the high 
concentration of some strong lensing clusters (Kneib et al. 
2003; Broadhurst et al. 2005; Oguri et al. 2005; Hennawi 
et al. 2005). 


We have shown that triaxiality is a general problem 
that hampers any attempt to simply couple 2D and 3D 
mass estimates assuming spherical symmetry. Once pro¬ 
jected, triaxial halos are elliptical and lens modelling is 
able to take ellipticity into account. Usually dynamical 
or X-rays analyses do not fully incorporate such a com¬ 
plexity. This should be an important work to do before 
comparison to (or coupling with) lensing. In Sect. 5, we 
have assumed the statistical distribution of axis ratios pro¬ 
posed by Jing & Suto (2002) in order to calculate the mass 
M 2 (r) within cylinder of radius r and the mass M tlue (r) 
within the sphere of same radius. The difference is im¬ 
portant and can lead to a significant discrepancy in the 
relative normalization between 2D and 3D mass estimates. 

The statistic of ij(r) shows that, at small scales R < 1, 
in average a systematic depart from unity is expected for 
77 with an important scatter and skewness toward high 
values of 77 . Therefore the relative normalization at small 
scales is biased and highly uncertain if one neglects pro¬ 
jection effects. At larger scales, the distribution of 77 con¬ 
verges to unity and explains why weak lensing mass es¬ 
timates are generally in better agreement with X-rays 
or dynamics of galaxies in clusters ( e.g. Allen 1998; 
Wu 2000; Arabadjis et al. 2004). Similarly, the coupling 
between stellar kinematics and strong lensing at clus¬ 
ters scales (Sand et al. 2002, 2004) or at galaxies scales 
(e.g. Koopmans & Treu 2002; Treu & Koopmans 2004; 
Rusin et al. 2003) may be oversimplistic since they do 
not take asphericity into account. First, the mean value 
?7 ~ 1.2 for r < 0.3r s leads to a expected systematic bias, 
but also the ~ 20 % scatter in the distribution of 77 will 
increase the uncertainty in the mass normalization and 
prevent the appealing temptation to couple these inde¬ 
pendent mass estimates. 

In conclusion, the density profile of the dark matter halo 
of MS2137-23 is well consistent with NFW and previous 
claimed discrepancies may be due to the spherical symme¬ 
try assumption. Indeed, it turns out that when coupling 
lensing to other mass estimates we cannot avoid a detailed 
(and cumbersome) 3D triaxial modelling of X-rays and dy¬ 
namical properties. It is worth noticing that such a level of 
refinement is already achieved in lensing studies that as¬ 
sume elliptical symmetry. The triaxiality of dark matter 
halos (and stellar components) is a major concern for joint 
modelling and should systematically be taken into account 
for future analyses. As well, it is possible that X-rays or op¬ 
tically selected clusters are biased toward elongated con¬ 
figurations, leading to an overefficiency for lensing. The 
increasing precision of observations makes the assumption 
of spherical symmetry abusive. Since clusters of galaxies 
are often seen as an important cosmological probe. It is 
important to better characterize their properties (mass, 
temperature, shape, abundance...) with realistic triaxial 
symmetries. 
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Appendix A: LOSVD of stars in the BCG 

The aim of this analysis is to derive the whole line-of- 
sight velocity distribution (LOSVD) of stars from the 
gravitational potential <I>. r denotes the three-dimensional 
radial coordinate whereas R is the 2D projected radius 
and the density of tracers p*(r) (the luminosity density). 
We assume that the distribution function DF /(ryu) can 
be modeled by Osipkov-Merritt (Osipkov 1979; Merritt 
1985) distribution functions (DF) which depend on the 
reduced energy £ = *3 f(r) — v 2 / 2 and angular momentum 
L = rv sin £ through the variable 

+ <a ' 1) 

In these equations 'F(r) = <&(r max ) — $(r) is the reduced 
potential, £ is the polar angle of the velocity direction 
with respect to r and r max is the outermost radius at 
which a particle is bound to the system, i.e. satisfying 
£ > 0. Otherwise specified, we set r max = 2 Mpc in 
the following. r a is the anisotropy radius. Orbits are nearly 
isotropic for r a —> oo and nearly radial for r > r a . 

For Osipkov-Merritt models, the DF f(Q) can directly 
be calculated, through the Eddington formula (Binney & 
Tremaine 1987) 


f(Q) 


i 

\/8tt 2 



d 2 p* dH/ 
clT 2 


1 / d/5* \ 

VQ Vd^/fco 


(A - 2) 

where p* = (1 + yr)p*. 

Once the Eq. (A.2) numerically integrated, it is pos¬ 
sible to derive the LOSVD p(R,v u ) as a function of the 
projected radius R by integrating over the line of sight 
coordinate 2 and over the perpendicular velocity u_l with 
v 2 = v 2 + v\. 


pZ m rV±,m 

p(R,v„) oc / d z v±dvj_ f(Q), (A.3) 

Jo Jo 

with z m the maximum line-of-sight coordinate for a par¬ 
ticle moving at velocity v located at the projected radius 
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R and satisfying d>(^R 2 + z^) = v 2 /2. In the isotropic 
case, Eq. (A.3) can be simplified : 

/■2'i r(R) f : m(») 

Piso(R,v u ) = 2tt / dv 2 d zf(£), (A. 4) 

Jvl Jo 

and a numerical integration is rather fast. However, in the 
general case, this is not possible and we present in the 
following a much faster Monte-Carlo technique. 

The integration of Eq. (A.3) is done by randomly sam¬ 
pling the distribution function with a large number N of 
stars. Since the stellar density profile is known to a scaling 
mass-to-light ratio, one can assign a radius r to each star 
according to the cumulative Hernquist stellar mass profile 

M*(r) = M* (^ r + r ''j ■ Each radius r* can be projected 
onto the plane of sky yielding Ri and Zi, the line of sight 
coordinate as before. At this point, it is trivial to incor¬ 
porate the smearing due to observational conditions like 
seeing by adding a random displacement Ri —* Ri + SR 8 . 
Similarly, if the slit width A is negligible Ri = i?, can 
be identified to the position along the slit, otherwise, it 
is straightforward to split R, into ( Xi,yi ), only consider 
those points satisfying 2\yi\ < A and then identify Xi as 
the position along the slit. This is the situation we shall 
consider in the following. 

This spatial sampling of the DF is thus independent 
of the potential 4/ or the anisotropy radius r a and can be 
stored for further calculation. For a given 4/ (r) and r a , one 
must solve Eq. (A.2), assign a velocity v and a velocity ori¬ 
entation using the calculated DF f(Q). This sampling is 
done with acceptance-rejection technics ( e.g . Press et al. 
1992). See also Kuijken & Dubinski (1994) or Kazantzidis 
et al. (2004) for similar applications. We can write the 
conditional PDFs for the polar angle ( and Q at radius r 


KCk) =2 


sinC 


(l + gsin 2 c) 


3/2 


(A.5) 


p(Q\r) <xf(Q)y/2(*(r)-Q) . 


Hence each star has a position Xi and a line-of-sight ve¬ 
locity v > M) j. It is now possible to calculate p(R, v„) and the 

associated velocity dispersion o\ os {R) = \J~x% and kurtosis 

k(R) = —f-3, respectively related to the second and 

forth order moments of p(R, v M ). 

We now compute the LOSVD deduced from the best 
fit NFW model of MS2137 and compare the inferred ve¬ 
locity dispersion to the measurements of Sa04. We as¬ 
sume the same observational conditions i.e. a slit width 
A = 1.25" ~ 5.8 /iyjj 1 kpc and a gaussian seeing 0.6" = 
2.8 kpc FWHM. These data were obtained by assum¬ 
ing Gaussian absorption lines, van der Marel & Franx 
(1993) showed that departs from Gaussianity imply a bias 

8 where SR may follow a 2D Gaussian distribution with stan¬ 
dard deviation cr see i ng = FWHM/2.35. 


in any velocity dispersion measurement. To the first order, 
the biased pseudo-velocity dispersion a^ 4 reads: 


&h.4 


^los 

1 + k/8 ■ 


(A.6) 


In the Gaussian case (k = 0), cr\ u reduces to <ti os . 

Fig. A.l shows the LOSVD as a function of the line- 
of-sight velocity v u for the innermost and outermost radial 
bins of Sa04. Departs from Gaussianity are visible close to 
the center and decrease with increasing radius. Therefore, 
the velocity bias changes with projected radius as can be 
seen on the top panel of Fig. 5, in which we plot a\ os (R) 
and eri l4 ( R ) for two values of the anisotropy radius r a = oo 
and r a = 10 kpc. 



Fig. A.l. Line-of-sight velocity probability distribution for stars with 
a projected radius in the innermost —0.5 < R < 1.5 hjQ kpc (the upper 
black histogram) and outermost bins 6.5 < R < 8.5 kpc (the lower 
red histogram)of Sa04. In each case, we plot a gaussian distribution with 
the same dispersion for comparison. One can see non-gaussian tails for 
innermost stars. In this exemple, we consider the the best fit NFW model 
of Sect. 2.3 and orbits are isotropic. 


Appendix B: Further evidence for triaxiality 

In this section we follow the formalism of Binney (1985) 
and Romanowsky & Kochanek (1998) and calculate the 
orientation and axis ratio of a projected triaxial dis¬ 
tribution as a function of its intrinsinc 3D axis ratios 
0 < c < b < 1 such that the density p(r) = p(m ) with 
m 2 = x 2 + y 2 /b 2 + z 2 /c 2 . We express the orientation of 
the minor axis with the polar angles (9 , (j>) relative to the 
line of sight. The projected density reads : 

2 Z" 00 

%(x, y ) = -jj J p(u 2 +m 2 )du, (B.l) 


where 










R. Gavazzi: The triaxiality of MS2137-23 


15 


f = sin 2 9 ( cos 2 <j> ± 
1 


sm 


b 2 


cos 2 9 


m 2 = — ( Ax 2 + Bxy ± Cy 2 ) 


A = 


cos 2 9 


cos 


sm 


4> \ sin 2 9 


b 2 


B = cos 9 sin 2</> (l — 1/5 2 ) 


b 2 

1 

72 


C = 


sin 2 cj> 


1 

+ cos^ </>) — . 


The projected distribution is elliptical with an 
g and a position angle %)> given by : 


~ 2 

"x/y 


2 / 


tan = 


A + Ct \JB 2 ± {A - C ) 2 ’ 

q =q v /qx , 

B 


A-C 


(B.2) 
(B.3) 
(B.4) 
(B.5) 
(B.6) 
axis ratio 

(B.7a) 

(B.7b) 

(B.7c) 


These equations are verified by the dark matter and the 
stellar components which have their own axis ratios cdm, 
6dM) c* and 6* but their principal axes are assumed to 
match. Generally, different values of c* and bi lead to 
different values of g* and ?/V This is what we observe 
in MS2137 where the light satisfies g* = 0.83 ± 0.12, 

?/>* = (18 ± 1) deg and our NFW lens modelling yields 
g DM = 0.750 ± 0.005 and 0 D m = (4.90 ± 0.15) cleg. 
Hence we can infer the parameters 9, cj >, Cdm, ^dMi c*, b * 
from these constraints and some additional priors since 
the problem is underconstrained. We can use the axis ra¬ 
tio distribution found in cosmological simulations by Jing 
& Suto (2002): 

i>(c)= 7ik exp (“ l£ ^) • (B - 8a) 

/, n = _3_ / 25 ~ 1 — max(c, 1/2) \ 2 

^ 2(1 — max(c, 1/2)) \ 1 —max(c, 1/2) J ’ 

(B.8b) 

with a c ~ 0.113 and c ~ 0.54. In addition, since the num¬ 
ber of constraints is not sufficient we force the intermedi¬ 
ate axis ratios 6* and 5dm to be equal to the most probable 
value. In other word, we have 


P(b |c) = 6 


1 + max(l/2, c)\ 

2 J ' 


(B.9) 


The best fit with priors yields : 9 = 27.4 ± 5.1, cdm = 
0.55±0.08 and c* = 0.52±0.12. This analysis gives strong 
indications on the reliability of triaxial dark matter and 
stellar distribution with the major axis relatively close to 
the line-of-sight and a value of cdm close to that inferred 
to explain the discrepancy between 2D and 3D mass esti¬ 
mates in Sect. 4.1. 



